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CLONING AND EXPRESSION OF REN I L LA LDCIFERASE 



BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to the field of genetic 
engineering and is particularly related to the 
expression of proteins by techniques involving genetic 
engineering. 



15 . Description of the Background 

The Renilla, also known as sea pansies, belong 
to a class of coelenterates known as the anthozoans. 
In addition to Renilla , other representative 
bioluininescent genera of the class Anthozoa include 

20 Cavarnularia. Ptilosarcus , Stylatula , . Acanthoptilum , 
and Para2oanthus . All of these organisms are 
bioluminescent and emit light as a result of the action 
of an enzyme (luciferase) on a substrate (luciferin) 
under appropriate biological conditions. Prior studies 

25 have demonstrated that all of the above-mentioned 
anthozoans contain similar lucif erases and 
luciferins. See, for example, Cormier et al., j. Cell. 
Physiol. (1973) 81: 291-298. The lucif erases and 
luciferins from each of these anthozoans will cross- 

10 react with one another to produce the characteristic 
blue luminescence observed in Renilla extracts. Each 
of these luciferases has similar biochemical 
properties, and the biochemical requirements for 
bioluminescence are identical regardless of the 

5 anthozoan from which the luciferase was derived. 

There has been considerable interest of late 
in replacing radioactive labels used in analytical 
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assays with other types, such as luminescent labels. 
Firefly lucif erase, which is a molecule of •> 
significantly different structure that does not react 
with fienilla-like luciferins, is one molecule that has 
5 been proposed for use as such labels. However, firefly 
luciferase suffers from a number of deficiencies that 
make this molecule less than optimal in biological 
assays. For example, ATP acts as a trigger of the 
firefly luciferase system, and the ubiquitous nature of 
10 ATP makes control of this variable difficult. 

A. prior patent application by one of the 
present inventors, U.S. Patent Application Serial No. 
059,137, filed June 5, 1987, describes use of 
coelenterate-derived luciferases and photoproteins as 
15 bioluminescent labels. Other applications by the same 
inventor, for example, D.S. Application Serial Nos. 
173,045, filed March 17, 1988, and 165,422, filed 
February 29, 1988, describe recombinant DNA capable of 
expressing the photoprotein apoaequorin. 
20 Tii e photoprotein aequorin (which consists of 

apoaequorin bound to a coelenterate luciferin molecule) 
and Renilla luciferase both utilize the same 
coelenterate luciferin, and the chemistry of light 
emission in both cases has been shown to be the same. 
25 However, aequorin luminescence is triggered by calcium, 
does not require dissolved oxygen, and represents a 
single turnover event. In contrast, Renilla luciferase 
is not triggered by calcium and requires dissolved 
oxygen in order to produce light in the presence of 
30 coelenterate luciferin. Renilla luciferase also acts 
as a true enzyme, catalyzing a long-lasting 
luminescence in the presence- of saturating levels of 
luciferin. 

Sub-attomole levels of aequorin can be 
35 detected with photometers even though its luminescence 
represents a single turnover event. Renilla 
luciferase, because of its enzymatic ability, should be' 
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detectable at levels 1 to 2 orders of magnitude lower 
than aequorin. Furthermore, Renilla luciferase is 
known to be relatively stable to heat, an important 
consideration for assays that often involve incubation 
5 at physiological temperatures. Accordingly, Renilla 

luciferase is a potentially useful label for biological 
and other assays. 

On the other hand, Renilla live on the ocean 
bottom, about 30 to 100 feet deep, and must be 
10 collected by dregging. From 1 kg of Renilla (about 
1000 animals), approximately 1 mg of pure Renilla 
luciferase can be obtained following a tedious 
procedure which requires purifying the protein about 
12,000 fold. The purification procedure is described 
in Matthews et al., Biochemistry (1977) 16: 85-91. As 
a result, there has been no development of Renilla 
luciferase as a detectable label. 

Accordingly, improved techniques for the 
production of pure Renilla luciferase are necessary 
before this molecule can be used commercially in 
bioluminescence assays. 



15 



20 



BRIEF DESCRIPTION OF TEE DRAWINGS 
The invention will be better understood by 
25 reference to the following detailed description and 

examples and the attached Figures that form part of the 
present specification, wherein: 

Figure 1 is the nucleotide sequence of a clone 
that contains a Renilla renif ormis luciferase cDNA 
30 sequence. 

Figure 2 is the amino acid sequence derived 
from the open reading frame of the Renilla luciferase 
cDNA shown in Figure 1. 

Figure 3 is the recombinant luciferase amino 
acid sequence with different types of underlining to 
show the location of peptides obtained by digestion of 
native luciferase with V-8 protease. 
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Figure 4 is a table showing the amino acid 
sequence of Renilla reniformis peptides obtained by V-8 
protease digestion. Regions of low degeneracy that 
were selected for preparation of oligonucleotide probes 
are shown by boxes. The probes are shown at the bottom 
part of the Figure. 

Figure 5 is a schematic representation of a 
restriction enzyme map for Renilla luciferase cDNA. 
The lower portion of Figure 5 is a schematic 
representation of sequencing strategy for Renilla 
luciferase cDNA. 

Figure 6 is a map of a Renilla luciferase 
expression plasmid. 

Figure 7 is a schematic diagram of the control 
15 region of the plasmid pTZRLuc-l. 

Figure 8 is a schematic diagram of the 
purification scheme used to purify recombinant 
luciferase . 

20 SUMMARY OF THE INVENTION - 

The present invention provides genetic mater- 
ial encoding Renilla luciferase. The genetic material 
can be used to produce the enzyme for use as 
luminescent tags in biolurainescence assays and for 
other purposes for which such labels are desirable. 
Additionally, the genetic material can be used as a 
source of probes that can be used in nucleic acid 
hybridization assays for the identification of other 
luciferase genes from related organisms. Fragments of 
the enzyme can be used to prepare antibodies for the 
purpose of identifying luciferase genes from related 
organisms. Specific genetic materials and luciferase 
proteins are disclosed in the following detailed 
description and examples. 
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DESCRIPTION OF SPECIFIC EMBODIMENTS 
The present inventors have identified and 
obtained for the first time genetic material encoding 
lucif erase from the coelenterate genus Renilla which 
previously has been available only in limited 
quantities, since lucif erases have a number of uses as 
a bioluminescent label and since Renilla luciferase has 
a number of properties that make it particularly useful 
as a label , availability of the enzyme in significant 
quantities in pure form provides a significant 
commercial advantage over prior sources. The Renilla 
genetic material also provides a source of nucleic acid 
probes for use in hybridization techniques that allow 
location of lucif erase genes in related organisms. The 
cDNA sequence for a clone that contains a Renilla 
reniformis lucif erase gene is set forth in Figure 1, 
with the translated cDNA amino acid sequence being set 
forth in Figure 2. The coding sequence of the clone in - 
Figure 1 begins at nucleotide 10 and continues to a 
stop codon at nucleotide 944. Figure 3 shows a 
complete recombinant Renilla luciferase amino acid 
sequence as produced by an expression system. 

The present invention has specifically 
contemplated each and every possible variation of 
polynucleotide that could be made by selecting 
combinations based on the possible codon choices listed 
in Figure 1 (with the reading frame beginning at 
position 1 of Figure 1) and in Table 1 (below), and all 
such variations are to be considered as being 
specifically disclosed and equivalent to the sequence 
of Figure 1. Codons are preferably selected to fit the 
host cell in which the enzyme is being produced, 
Selection of codons to maximize expression of proteins 
in a heterologous host is a known technique. 

Other DNA molecules that code for such pep- 
tides can readily be determined from the list of codons 
in Table 1 and are likewise contemplated as being equi- 
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valent to the DNA sequence of Figure 1. in fact, since 
there is a fixed relationship between DNA codons and 
amino acids in a peptide, any discussion in this appli- 
cation of a replacement or other change in a peptide is 
equally applicable to the corresponding DNA sequence or 
to the DNA molecule., recombinant vector, or transformed 
microorganism in which the sequence is located (arid 
vice versa ) . 
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TABLE 1 



GENETIC CODE 



10 



15 



20 



25 



30 



E) 



Alanine (Ala, A) 
Arginine (Arg, R) 
Asparagine (Asn, N) 
Aspartic acid (Asp, D) 
Cysteine (Cys, C) 
Glutamine (Gin, Q) 
Glutamic acid (Glu, 
Glycine (Gly, G) 
Histidine (His, H) 
Isoleucine (lie, I) 
Leucine (Leu, L) 
Lysine (Lys, K) 
Methionine (Met, M) 
Phenylalanine (Phe, 
Proline (Pro, P) 
Serine (Ser, S) 
Threonine (Thr, T) 
Tryptophan (Trp, " 
Tyrosine (Tyr, Y) 
Valine (Val, V) 
Termination signal 



F) 



W) 



GCA, 
AG A, 
AAC, 
GAC, 
TGC, 
CAA, 
GAA , 
GGA, 
CAC, 
ATA, 
CTA, 
AAA, 
ATG 
TTC, 
CCA, 
AGC, 
ACA, 
TGG 
TAC, 
GTA/ 
TAA, 



GCG, GCT 

CGA, CGC, CGG T CGT 



GCC, 
AGG, 
AAT 
GAT 
TGT 
CAG 
GAG 

GGC, GGG, GGT 
CAT 
ATC, 
CTC, 
AAG 



ATT 

CTG, CTT, TTA, TTG 



TTT 

CCC, CCG, CCT 

AGT, TCA, TCC, TCG, TCT 

ACC, ACG, ACT 

TAT 

GTC, GTG, GTT 
TAG, TGA 



Key: Each 3-letter triplet represents a trinucleotide 
of DNA having a 5 ' end on the left and a 3* end on the 
right. The letters stand for the purine or pyrimidine 
bases forming the nucleotide sequence: A = adenine, 
G = guanine, C = cytosine, and T = thymine. The RNA 
code is the same except that U (uracil) replaces T. 
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In addition to the specific nucleotides listed 
in Figure l r DKA (or corresponding RNA) molecules of 
the invention can have additional nucleotides preceding 
or following those that are specifically listed. For 
5 example, poly A can be added to the 3 '-terminal; a 

short (e.g., fewer than 20 nucleotides) sequence can be 
added to either terminal to provide a terminal sequence 
corresponding to a restriction endonuclease site, stop 
codons can follow the peptide sequence to terminate 
10 translation, and the like. Additionally, DNA molecules 
containing a promoter region or other control region 
upstream from the gene can be produced. All DNA mole- 
cules containing the sequences of the invention will be 
useful for at least one purpose since all can minimally 
be fragmented to produce oligonucleotide probes and be 
used in the isolation, or detection of DNA from biologi- 
cal sources. 

A number of words used in this specification 
have specific meanings in addition to their more "common 
20 meanings. "Renilla lucif erase" means the lucif erase 
enzyme isolated from a member of the genus Renilla or 
an equivalent molecule obtained from any other source 
or synthetically. By "equivalent" is meant, when 
referring to two nucleotide sequences, that the two 
nucleotide sequences in question encode the same 
sequence of amino acids. When "equivalent" is used in 
referring to two peptides, it means that the two 
peptides will have substantially the same amino acid 
sequence. When "equivalent" refers to a property, the 
property does not need to be present to the same extent 
(e.g., two peptides can exhibit different rates of the 
same type of enzymatic activity) , but the properties 
are preferably substantially the same. 
"Complementary," when referring to two nucleotide 
35 sequences, means that the two sequences are capable of 
hybridizing, preferably with less than 25%, more 
preferably with less than 15%, even more preferably 
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with less than 5%, most preferably with no mismatches 
between opposed nucleotides. Preferred hybridizing 
conditions (which are not limited to specific numbers 
of mismatches) are set forth in the Examples. The term 
"substantially" varies with the context as understood 
by those skilled in the relevant art and generally 
means at least 70%, preferably means at least 80%, more 
preferably at least 90%, and most preferably at least 
95%. The phrase "substantially identical" includes 
complete identity as well' as less than complete 
identity (e.g., of amino acid sequences or enzymatic 
activity) as established by the prior definition of 
"substantially." The term "isolated" as used herein 
refers to, e.g., a .peptide, DNA, or RNA separated from 
other peptides, DNAs, or RNAs, respectively, and being 
found in the presence of (if anything) only a solvent, 
buffer, ion or other component normally present in a 
biochemical solution of the same. "Isolated" does not 
encompass either natural materials in their native 
state or natural materials that have been separated 
into components (e.g., in an acrylaraide gel) but not 
obtained either as pure substances or as solutions. 
The phrase "replaced by" or "replacement" as used 
herein does not necessarily refer to any action that 
must take place but to the peptide that exists when an 
indicated "replacement" amino acid is present in the 
same position as the amino acid indicated to be present 
in a different formula (e.g., when leucine instead of 
valine is present at amino acid 11). 

Since the DNA sequence of the Renilla lucif- 
erase gene has been identified, it is possible to 
produce a DNA gene entirely by synthetic chemistry, 
after which the gene can be inserted into any of the 
many available DNA vectors using known techniques of 
35 recombinant DNA technology. Thus, the present inven- 
tion can be carried out using reagents, plasmids, and 
microorganisms which are freely available and in the 



20 



25 



30 



OC1D: <WO 821S673A1* 



WO 92/15673 



PCT/US91/01614 



10 



10 



25 



30 



35 



public domain at the time of filing of this patent 
application without requiring a deposit of genetic 
material. 

For example, nucleotide sequences greater thai: 
100 bases long can be readily synthesized on an Applied 
Biosystems Model 380A DNA Synthesizer as evidenced by 
commercial advertising of the same (e.g., Genetic ' 
Engineering News, November/December 1984, p. 3). such 
oligonucleotides can readily be spliced using, among 
others, the technique of preparing overlapping comple- 
mentary sequences (e.g, 1-100 of coding strand, 0-50 
and 51-150 of complementary strand, 101-200 of coding 
strand-, etc.], followed by hybridizing and ligating the 
strands. Such techniques are well known and are 
15 described in detail in, for example, Davis et al. , 

Basic Methods in Mol ecular Biology . Elsevier Science 
Publ. Co., Inc., New York (1986). The peptides can 
then be expressed in a host organism as described 
herein. 

Furthermore, automated equipment is also 
available that makes direct synthesis of many of the 
peptides disclosed herein readily available, especially 
peptide fragments of less than the entire Renilla 
lucif erase enzyme. In the same issue of Genetic 
Engineering News mentioned above, a commercially 
available automated peptide synthesizer having a 
coupling efficiency exceeding 99% is advertised (page 
34). Such equipment provides ready access to the 
peptides of the invention, either by direct synthesis 
or by synthesis of a series of fragments that can be 
coupled using other known techniques. 

In addition to the specific polypeptide se- 
quence shown in Figures 2 and 3, peptide fragments 
based on these sequences and fragments and full length 
sequences representing minor variations thereof will 
have, at least some of the biological activities of 
luciferase and will therefore be useful in appropriate ' 
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circumstances. For example, fragments of the 
luciferase enzyme sequence can readily be prepared and 
can- be screened for use as luciferin binding site 
models. Peptide synthesizers can be used to prepare 
5 small polypeptide fragments (e.g., less than 100 amino 
acids) or techniques of genetic engineering can be used 
to prepare larger fragments. A simple screening - 
procedure that will identify suitable polypeptide 
fragments consists of attaching a suitable substrate, 
10 e.g., a coelenterate luciferin molecule, to an affinity 
column and capturing peptide fragments that are 
retained .by the bound substrate. Such peptides can 
also be used (and are indeed more likely to be used) as 
immunogens for the preparation of antibodies that can 
15 be used to screen for the expression of a luciferase by 
a genetically engineered organism, in which case the 
bound substrate will be an antibody or similar molecule 
" that binds specifically to Renilla luciferase. 

The ability to prepare and select peptide 
fragments having appropriate binding affinity from a 
larger protein is well known in the art and is 
described in a number of publications, including 
patents. See, for example, U.S. Patent No. 4,629,783, 
which describes the preparation of immunologically 
active fragments of viral proteins that bind with the 
same antibodies as the entire viral protein. 

In addition, minor variations of the previous- 
ly mentioned peptides and DNA molecules are also con- 
templated as being equivalent to those peptides and DNA 
molecules that are set forth in more detail, as will be 
appreciated by those skilled in the art. For example, 
it is reasonable to expect that an isolated replacement 
of a leucine with an isoleucine or valine, an aspartate 
with a glutamate, a threonine with a serine, or a simi- 
lar replacement of an amino acid with a structurally 
related amino acid (i.e., a conservative replacement) 
will not have a major effect on the biological activity' 
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of the resulting molecule, especially if the replace- 
ment does not involve an amino acid at a binding site 
or other site of biologic activity. Furthermore, 
additional amino acids can be present at either of the 
5 two termini, or amino acids can be absent from one or 
both of the termini, as is known in the art. 

Whether a change results in a functioning- pep- 
tide can readily be determined by direct analysis for 
function in a assay that relies on ability of the 
10 modified enzyme (or fragment) to carry out the normal 
function of the natural luciferase enzyme (or 
fragment). For example, modified peptides can be 
tested for ability to catalyze the emission of light 
from coelenterate luciferin by the same techniques 
15 described below for the recombinant Renilla luciferase 
molecule. Peptides in which more than one replacement 
has taken place can readily be tested in the same 
manner. Preferred peptides differ at no more than 12, 
more preferably no more than 5, amino acids in any 
20 contiguous group of 20 amino acids. Substitutions of 
amino acids, when they occur, are preferably from 
within standard conservative groups. Standard 
conservative groups of amino acids are shown in 
parenthesis using the one-letter amino acid code: non- 
polar (A,V,L,I,P,M ); aromatic (F,T,W); uncharged polar 
(G,S,T,C,N,Q); acidic (D,E) ; basic (K,R,H) . The aroma- 
tic amino acids are sometimes considered to belong to 
the broader-defined nonpolar (F,W) or uncharged polar 
(T) groups; 

Salts of any of the peptides described herein 
will naturally occur when such peptides are present in 
(or isolated from) aqueous solutions of various pHs. 
All salts of peptides having the indicated biological 
activity are considered to be within the scope of the 
35 present invention. Examples include alkali, alkaline 
earth, and other metal salts of carboxylic acid resi- 
dues, acid addition salts (e.g., HC1) of amino resi- 
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dues, and zwitterions formed by reactions between 
carboxylic acid and amino residues within the same 
molecule. 

Although genes and corresponding proteins can 
5 be prepared by the totally synthetic techniques dis- 
cussed above, in preferred embodiments of the invention 
genetic information is obtained from natural sources 
and identified as described herein. The genetic mater- 
ial is first obtained in the form of a gene library, 
10 using any of numerous existing techniques. The first 
of these is to randomly shear genomic DNA and insert 
this sheared material into expression vectors. if 
enough recombinants are generated, there is a good 
probability of having at least one recombinant in the 
15 population which is expressing a fusion protein corre- 
sponding to the enzyme of interest. 

Another strategy for preparing gene libraries 
is to make complementary DNA (cDNA) copies of the total 
mRNA population of the organism and to clone these as 
20 recombinant molecules in expression vectors. The 

expected nature of the organism (i.e., it was expected 
to have the characteristics of a eucaryote) indicated 
that introns might be present within the coding region 
of the desired gene. Although introns do not preclude 
25 use of sheared genomic DNA, they increase the number of 
recombinants which must be screened and make further 
analyses substantially complicated. Based on this 
result, use of a cDNA library to obtain Renilla genes 
is preferred. 

30 . Such a library was generated in the laboratory 

of the inventors and screened for expression of a gene 
product having luciferase activity. Details of this 
example are set forth below, including details of the 
experiments that lead to obtaining the complete 

35 sequence of the gene. However, there is no reason to 
believe that the sequence and specific engineered 
organism prepared by the inventors is any better than ' 



WO 92/15673 



PCT/US91/01614 



14 



10 



35 



other clones that can be prepared using the guidance 
set forth in this specification. in fact, it is likely 
that expression of Renilla luciferase can be enhanced 
over that described herein by selection of other 
expression systems, as discussed in the examples below. 

Now that the sequence of Renilla luciferase 
has been determined, it is no longer necessary to go 
through these steps to obtain the genetic material of 
the present invention. The polymerase chain reaction 
(PCR) technique can now be used to isolate genes from 
■natural sources in a simpler and more direct manner. 
^The PCR technique, including use in diagnosis, is 
disclosed in U.S. Patent 4,683,202, which is herein 
incorporated by reference. Since Renilla specimens are 
15 readily available from the oceans of the world, and 
since PCR probes can be prepared using the sequences 
set^ forth in this specification, it is possible to 
obtain any desired segment of the sequences set forth 
herein using the PCR technique and naturally available 
20 sources of Renilla genomic material. A specific 

example of such a technique for isolating the Renilla 
luciferase chromosomal gene is described in the 
examples that follow. The cloned gene can then be 
inserted into commercial vectors and expressed. 

Although the techniques set forth above, when 
used in combination with the knowledge of those skilled 
in the art of genetic engineering and the previously 
stated guidelines, will readily enable isolation of the 
desired gene and its use in recombinant DNA vectors now 
that sufficient information is provided to locate the 
gene, other methods which lead to the same result are 
also known and may be used in the preparation of recom- 
binant DNA vectors of this invention. 

Expression of Renilla protein can be enhanced 
by including multiple copies of the gene in a trans- 
formed host; by selecting a vector known to reproduce 
the host, thereby producing large quantities of pro- 
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tein from exogeneous inserted DNA (such as pUC8; 
ptacl2; pIN-III-ompAl, 2, or 3; pOTS; pASl ; or pKK223- 
3); or by any other known means of enhancing peptide 
expression. 

5 One common variation is the preparation of a 

polypeptide of the invention in the form of a fused 
polypeptide. Such peptides are typically prepared by 
using the promoter region of a gene known to be ex- 
pressed in a host and inserting nucleotides that encode 
all or a major portion of the amino acid sequence of 
the invention into the genetic sequence for the host 
protein. Examples of such fused proteins -include e- 
galactosidase fused proteins, if desired, the fused 
peptide can be designed so that a site recognized by a 
proteolytic enzyme is present at the junction between 
the two fused proteins. The proteolytic enzyme can 
then be used to cleave the expressed protein so that 
the desired luciferase enzyme is available in pure 
form. 

In all cases, a Renilla luciferase will be ex- 
pressed when the DNA sequence is functionally inserted 
into the vector. By "functionally inserted" is meant 
in proper reading frame and orientation, as is well 
understood by those skilled in the art. Typically, a 
gene will be inserted downstream fronra promoter and 
will be followed by a stop codon, although production 
as a hybrid protein (possibly followed by cleavage) may 
be used, if desired. 

In addition to the above general procedures 
which can be used for preparing recombinant DNA mole- 
cules and transformed unicellular organisms in accor- 
dance with the practices of this invention, other known 
techniques and modifications thereof can be used in 
carrying out the practice of the invention. in partic- 
ular, techniques relating to genetic engineering have 
recently undergone explosive growth and development. 
Many recent U.S.. patents disclose plasmids, genetically' 
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engineering microorganisms, and methods of conducting 
genetic engineering which can be used in the practice 
of the present invention. For example, U.S. Patent 
4,273,875 discloses a plasraid and a process of isolat- 
5 ing the same. U.S. Patent 4,304,863 discloses a pro- 
cess for producing bacteria by genetic engineering in 
which a hybrid plasmid is constructed and used to - 
transform a bacterial host. U.S. Patent 4,419,450 
discloses a plasmid useful as a cloning vehicle in re- 
10 combinant DNA work. U.S. Patent 4,362,867 discloses 
recombinant cDNA construction methods and hybrid nuc- 
leotides produced thereby which are useful in cloning 
processes. U.S. Patent 4,403,036 discloses genetic 
reagents for generating plasmids containing multiple 
15 copies of DNA segments. U.S. Patent 4,363,877 disclo- 
ses recombinant DNA transfer vectors. U.S. Patent 
4,356,270 discloses a recombinant DNA cloning vehicle 
and is a particularly useful disclosure for those with 
limited experience in the area of genetic engineering 
since it defines many of the terms used in genetic 
engineering and the basic processes used therein. U.S. 
Patent '4, 336,336 discloses a fused gene and a method of 
making the same. U.S. Patent 4,349,629 discloses plas- 
mid vectors and the production and use thereof. U.S. 
Patent 4,332,901 discloses a cloning vector useful in 
recombinant DNA. Although some of these patents are 
directed to the production of a particular gene product 
that is not within the scope of the present invention, 
the procedures described therein can easily be modified 
to the practice of the invention described in this 
specification by those skilled in the art of genetic 
engineering. 

The implications of the present invention are 
significant in that useful amounts of Renilla 
luciferase and genetic material of the invention will 
become available for use in the development of hybridi- 
zation assays or in any other type of assay utilizing 
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these materials. Transferring the Renilla luciferase 
cDNA which has been isolated to other expression 
vectors will produce constructs which improve the ex- 
pression of luciferase in E. coli or express the 
5 polypeptide in other hosts. 

Particularly contemplated is the isolation of 
genes from related organisms using oligonucleotide 
probes based on the principal and variant nucleotide 
sequences disclosed herein. Such probes can be 
10 considerably shorter than the entire sequence but 
should be at least 10, preferably at least 14, 
nucleotides in length. Intermediate oligonucleotides 
from 20 to 500, especially 30 to 200, nucleotides in 
length provide particularly specific and rapid-acting 
15 probes. Longer oligonucleotides are also useful, up t. 
the full length of the gene. Both RNA and DNA probes 
can be used. 

In use, the probes are typically labelled in a 
detectable manner (e.g., with 32 P, 3 H , biotin# or av -_ 
20 din) and are incubated with single-stranded DNA or RNA 
from the organism in which a gene is being sought. 
Hybridization is detected by means of the label after 
single-stranded and double-stranded (hybridized) DNA 
(or DNA/RNA) have been separated (typically using 
25 nitrocellulose paper). Hybridization techniques suit- 
able for use with oligonucleotides are well known. 

Although probes are normally used with a 
detectable label that allows easy identification, 
unlabeled oligonucleotides are also useful, both as 
precursors of labeled probes and for use in methods 
that provide for direct detection of double-stranded 
DNA (or DNA/RNA). Accordingly, the term "oligo- 
nucleotide probe" refers to both labeled and unlabeled 
forms . 

35 In summary, the inventors have reduced the 

present invention to practice by isolating and 
sequencing a cDNA clone for Renilla reniformis 
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luciferase. The deduced amino acid sequence from this 
cDNA, beginning at the first methionine residue, 
predicts a protein of M r equal to 36 kd, which is the 
approximate size of native Renilla luciferase. The 
5 ' deduced amino acid sequence also contains within it all 
six peptide sequences from V-8 protease-digested native 
Renilla luciferase. Only one mis-match was found - 
between these two sets of amino acid data, a 
substitution of a tryptophan for a leucine present in 
10 the peptide sequence. Comparisions of the native amino 
acid composition and the predicted recombinant 
luciferase composition reveal a very high degree of 
similarity with many identities between specific amino 
acid residues. 

15 Additionally, expression of luciferase in a 

genetically engineered organism has been 
demonstrated. Luciferase activity was found in crude 
extracts of the original luciferase clone XRLuc-6. 
Subcloning the cDNA into the vector pTZ18R increased 
this activity enough to allow the purification of 
recombinant luciferase from the pTZRLuc-1 cells. 
Recombinant luciferase can be purified by a much 
simplified method from that previously used in the 
purification of native luciferase. The recombinant 
luciferase functions identically to native luciferase 
in all aspects analysed thus far. Like native, 
recombinant luciferase has an emission spectrum with a 
xraax at 480 nra and a shoulder at 400 nm. The 
absorption spectrum of recombinant luciferase is also 
30 identical to that of native. Additionally, both native 
and recombinant luciferase are very stable at 37°C for 
several hours as well as having significant stability 
at 45°C. Using the specific activity determined for 
native luciferase, protein determinations made based on 
light emission correlate very well with A 280 and Lowry 
protein determinations, suggesting that the specific 
activity of recombinant luciferase is similar to, if 
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not the same as, that of native luciferase. Finally, 
amino-terminus amino acid sequence analysis of 
recombinant luciferase shows an identical sequence to 
that of the cDNA-predicted amino acid sequence from 
residues 2 through 18. A significant amount of the 
recombinant protein is blocked at the amino terminus, 
probably by N-formyl methionine, which accounts for the 
inability to determine the amino acid at residue 1. 

The invention now being generally described, 
the same will be better understood by reference to the 
following examples which are provided for purposes of 
illustration only and are not to be considered limiting 
of the invention unless so specified. 

EXAMPLES 



Assay for Luciferase Activity 

Crude supernatants or pure recombinant 
luciferase samples {10 to 100 M l} were added to 1 ml of 
luciferase assay buffer {0.5M NaCl, 0.1M KP0 4 pH 7.6, 
lmM EDTA, 0.02% BSA, and 0.004% HaN 3 ) and vortexed in 
12 X 75 mm test tube. Synthetic benzyl luciferin {lOul 
of a 2.5 nmol/ul stock} was added to the reaction to 
give a final concentration of 2.5 X 10* 8 M, and the 
25 mixture was vortexed rigorously for 4-5 sec. The tube 
was placed immediately in a Turner Model TD-20e 
luminometer and peak light emission was determined and 
converted to photons using a 63 Ni radioactive light 
emission calibration standard. 

30 

RNA Isolation and cDNA Synthesis 

Live Renilla reniformis were collected by 
bottom trawling in shallow waters off Sapelo Island in 
the state of Georgia at the University of Georgia 
35 Marine Institute. The animals were washed thoroughly 

in fresh seawater, quick f rozen in liquid nitrogen, and 
stored at -80'C. Frozen Renilla were crushed to a fine 
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powder under liquid nitrogen with a morter and 
pestle. The powdered tissue was then homogenized with 
a Waring blender in 4 M guanidine thiocyanate, and 
total RNA was isolated as described in Chirgwin et al 
Biochemistry (1970) 18:5294-5299. Total RNA wasTh^n" ' 
passed over an oligo-dT cellulose column to obtain 
polyadenylated RNA which was stored as an ethanol - 
precipitate at -20°C. Single and double stranded cDNA 
were synthesized from poly A + RNA by modification of 
the Gubler and Hoffman method, Gubler et al.. Gene 
(1983) 25:263-269, as described below. FollowIn7"T-4 
polymerase blunting and methylation of the cDNAs, 
synthetic EcoRI linkers were blunt-end ligated. After 
digestion with EcoRI , the excess linkers were separated 
from the CDNAs by low-melt agarose gel electrophor- 
esis. Only cDNA-s greater than about 650 bp in length 
were isolated from the low melt gel. 

Construction and Screeni ng of the > q m Library 

Purified cDNA's were ligated into EcoRI- 
digested xgtll. The DNA was then packaged using x 
phage extracts (Gigapack Plus Kit, Strategene). 
Several fractions of the packaged library were titered 
in JT1088 cells; these fractions ranged from 71* to 81% 
recombinant phage as determined by the lack of IPTG- 
mducible 8-galactosidase activity. The total number 
of recombinant phage was equal to 2,1 X 10 6 pf u (plague 
forming units). The primary library was then amplified 
in 1T1088 cells and stored in 7% DMSO at -8CC. The 
titer of the amplified library was 2.5 X 10? pfu/ml and 
was approximately 65% recombinant. 

Two 17-base oligonucleotide probes were- 
synthesized based on amino acid sequence data from 
isolated peptides derived from V-8 protease digested, 
35 native Renilla lucif erase, shown in Figure 4 are the 
ammo acid sequences of the seven V-8 luciferase 
peptides. The amino acid sequences with' the lowest 
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codon redundancy were selected for synthesis of 
luciferase oligonucleotide Probe #1 and Probe #2, which 
are shown highlighted with their derived nucleotide 
sequences (lower portion of Figure 4). Probe #1 was 
derived from peptide 7 and contained 32 redundancies, 
while Probe #2, derived from peptide l f contained 64 # 
redundancies. The probes were end-labeled with T-4 
polynucleotide kinase to high specific activity {4-9 x 
10 8 cpm/ng}. Y1088 cells were infected with enough 
phage to give 3 X 10 4 pfu/plate. The infected cells 
were plated in 6 ml of top agarose onto 150 mm diameter 
Luria plates containing 50 pg/ml arapicillin. After 
overnight incubation at 37°C, the plates were chilled 
at 4°c before performing plague lifts. To eliminate 
false positive signals, duplicate nitrocellulose filter 
plague replicas were prepared from each master plate. 
Filters were processed by base treatment followed by 
neutralization in Tris buffer. 

The filters were air dried and baked at 80°C 
20 in vacuo. Prehybridization was for at least 6 hours 
37°C in 6X SSC, 50 mM Sodium Phosphate (pH 6.8), 5X 
Denhardfs, and 100 ug/ral denatured Herring sperm 
DNA. Hybridization was overnight at 37°C in 
prehybridization solution with the addition of dextran 
25 sulfate to a final concentration of 10%. The labeled 
probes were added to the hybridization solution at 1-2 
X 10 6 cpm/ml. 

Filter washes were done in the presence of 
tetramethylammonium choride under the conditions 
described for a 17-base oligonucleotide in Wood et al. , 
Proc. Nat. Acad. Sci . USA (1985) 82:1585-1588. Each 
duplicate filter was hybridized to both probes- in the 
first round of screening; in subsequent rounds, the 
duplicate filters were hybridized to either Probe #1 or 
35 Probe #2. All cDNA clones were plaque purified after 
three or four rounds of screening? phage DNA was 
isolated from each clone on glycerol step gradients as * 
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described in Grossberger, D. , Nuc, Acid. Res. (1987) 
15f 16) :6737. 

DNA Sequence Analysis 

All DNA sequence analysis was done in the M13 
vectors mpl8 and mpl9. Single stranded templates were 
prepared and dideoxynucleotide sequencing was preformed 
using a Sequenase DNA Sequencing Kit obtained from 
United States Biochemical Corporation. Sequencing 
reactions were primed using either the M13 universal 
primer, a primer which hybridized to extraneous xgtll 
DNA present in some constructs, or the oligonucleotide 
probes. Sequence data obtained from both ends of the 
cDNA was analyzed for six base restriction enzyme sites 
15 which were used to generate sequencing subclones 

(Figure 5). In this way, the entire 1.2 kb cDNA was 
sequenced on both strands (lower portion of Figure 
5). All DNA sequences and translated protein sequences 
were assembled and analyzed using MicroGenie Sequence 
Software purchased from Beckman. 
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Expression in E. coli 

The initial lucif erase cDNA clone, XRLuc-6, 
was in the expression vector Xgtll. The clone was 
25 ampliflied in Y1088 cells and the high titer stock was 
used to make lysogens in ¥1089. The XRLuc-6 lysogen 
was then grown in Luria broth plus ampicillin (50 
ug/ml) at 37°C. The cells were pelleted, resuspended 
in TE buffer, and lysed with lysozyrae (2 mg/ml). The 
cell debris was then pelleted and the supernatant was 
assayed for lucif erase activity. The 2.2kbp XRLuc-6 
insert which included .1 kb of xgtll lacZ DNA attached 
to the 3 ' end was isolated on a low-melt gel and 
subcloned into the EcoRI/SstI sites of pTZ18R 
35 (Pharmacia). This construct, pTZRLuc-1, was used in 

the expression and purification of recombinant Renilla 
lucif erase. 
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Electrophoretic and Western Analysis 

Recombinant luciferase samples were 
characterized on Commassie-s tained SDS-PAGE gels. For 
5 Western analysis, the gels were run and transferred to 
nitrocellulose filters at .30 mA in transfer buffer as 
described in Burnett, N.W., Analytical Biochemistry 
(1981) 112:195-203. The filters were blocked with 3% 
BSA and incubated with a 1/1000 dilution of polyclonal 
10 rabbit-anti-luciferase antibodies. Next, the filter 
was washed in TBS and incubated with al/2500 dilution 
of the. secondary antibody, goat-anti-rabbit IgG 
conjugated to horseradish peroxidase (Bio-Rad). 
Finally, the filter was washed in TBS and developed 
15 with HRP-Color Developing reagent (Bio-Rad). 

Emission Spectra 

Crude samples of pTZLuc-1 cells were prepared 
as described previously in this text. The sample-was 
added to 1 ml of luciferase assay buffer; 1 w l 
luciferin (917 nmol/ u l) was added at 1-2 minute 
intervals to maintain the signal. The bioluminescence 
emission spectra were obtained and corrected using an 
on-line computerized SPEX fluorimeter. Multiple 
spectra were scan averaged to give the final spectrum 
which was measured from 675 nm to 375 nm. 
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Protein Purification 

Purification of recombinant Renilla luciferase 
from pTZRLuc-1 E. coli extracts was accomplished in 
three chromatographic steps. Recombinant luciferase 
was purified from pTZRLuc-1 cells as follows: pTZRLuc- 
1 cells were grown in 20L Luria broth at 37°C at an 
OD 600 =0 - 6 at which time IPTG was added to a final 
35 concentration of 0.5 raM; cells continued to grow 
overnight at 30°C. The cells were harvested by 
centrif ligation, washed in TE, resuspended in 5 ml of 10 
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mM EDTA < P H 8) per gram of cells, and frozen at 
-20°C. in a typical purification, 15 to 30 grams of 
cells were thawed. Lysozyme was added to a final 
concentration of 4 to 6 mg/ml, and the cells were held 
on ice for 45 minutes. DNase 1 (10 to 20 mg) was added 
to the lysate which was sonicated on ice with 1 minute 
bursts from a Branson Cell Disrupter until 90% of the 
cells were lysed as evidenced by microscopic 
examination. 

The crude material was clarified by 
centrifugation at 48 X g for 30 minutes and loaded onto 
the first column. The extract was first run on a DEAE- 
Cellulose ion-exchange column followed by a G-100 
Sephadex gel filtration column and then a Benzoic Acid- 
15 Sepharose affinity column. The G-100 column was run in 
IX Renilla Standard Buffer (1.5 mM Tris, 1.0 mM EDTA pH 
7.8). The other columns were run in IX buffer and were 
eluted in 10X buffer (DEAE) or Sodium Benzoate in 10X 
buffer (Benzoic Acid-Sepharose) . The first Benzoic 
20 Acid column was eluted with 0.1 M sodium benzoate 

pulse. The second Benzoic Acid column was eluted with 
a 0 to 0.5 M sodium benzoate gradient. Protein 
determinations were made by A 28Q measurements using the 
extinction coefficient of native luciferase {^ an o i% 
- z.i), by light emission using the specific activity 
of naive luciferase {1.8 X 10 15 hv sec^ 1 mg" 1 }, or by 
Bradford assays as described in Bradford, M. , 
Analytical Biochemistry (1976) 72: 248. Absorption 
spectra were measured and collected on a Varian Model 
30 DMS-100 spectrophotometer. 

Isolation and Analysis of XRLuc-6 

The primary screen of 1 x 10 6 recombinant 
phage resulted in the isolation of nine clones which 
35 gave identical autoradiographic signals on both replica 
filters. Of the nine original positives, only five 
gave signals on the second screening, and only one of 
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the five hybridized to both probes. The other four 
hybridized only to Probe #2, which has the greatest 
sequence redundancy. Restriction enzyme analysis of 
the five clones revealed that XRLuc-3 and XRLuc-8 were 
5 identical and contained a 1.16 kb insert. xRLuc 2, 5, 
and 6 had insert sizes of 0.8, 2.34 and 1.2 kbp 
respectively. Only the XRLuc-3 and XRLuc-8 inserts ' 
could be exercised from the EcoRI cloning site by EcoRI 
digestion. The other three inserts had apparently lost 
one EcoRI linker site; , these had to be cut with EcoRI 
and Sstl. Thus, each of these cDNAs contained 1 kb of 
xgtll DNA attached at one. end. Since only XRLuc-6 
hybridized to both oligonucleotide probes and contained 
a cDNA of the size necessary to code for an 
approximately 36 kd protein, it was chosen for DNA 
sequence analysis. 

The 2.2kb EcoRI/SstI fragment, which contained 
1 kb of xgtll lac Z DNA, was subcloned into Ml 3 and 
mpl8 and m P 19 and both strands of the 1.2 kb cDNA were 
20 completely sequenced. The entire cDNA sequence is 1196 
bp, excluding the EcoRI linker (Figure 1). 
Structurally, it contains a putative initiation codon 
beginning at nucleotide 10, a stop codon at nucleotide 
944, a polyadenylation consensus sequence at nucleotide 
1170, and a short polyadenylated tail of seven 
nucleotides (Figure 1). Also shown underlined in 
Figure 1 are the two oligonucleotide hybrization sites 
located at nucleotides 537-554 (Probe #1) and 
nucleotides 820-836 (Probe #2). The loss of the EcoRI 
30 site at the 3' end of the cDNA was confirmed by the 
sequence analysis. 

The cDNA does not contain a stop codon in 
frame with and upstream from the first initiation codon 
as an indication that the protein coding region is full 
35 length. However, the coding region directs the 
recombinant systhesis of fully active Renilla 
luciferase, as discussed below. Translating the cDNA " 
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sequence into an amino acid sequence gave conclusive 
evidence that the xRLuc-6 cDNA was a Renilla luciferase 
cDNA. The translated cDNA sequence contains an open 
reading frame of 314 amino acids (Figure 2). The first 
methionine is preceded by three amino acids which may 
or may not be part of the native protein sequence- If 
the in vivo translation begins at the first methionine, 
an open reading frame of 311 amino acids results which 
codes for a protein of molecular weight (M r ) ~36 Kd. 
The M r of native Renilla luciferase has been measured 
by various methods with values ranging from 33 Kd to 38 
Kd. Comparing the amino acid composition of this 
translated amino acid sequence with that of the 
previously published native luciferase composition 
15 shows a very close homology with many identities 
between the two (Table 2). 
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TABLE 2 



Amino Acid Composition of Native and 
Recombinant Renilla reniformis Luciferase 
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Amino Acids 

Lysine 

Histidine 

Arginine 

Aspartate* 

Threonine 

Serine 

Glutamate** 

Proline 

Glycine 

Alanine 

Valine 

Methionine 

Isoleucine 

Leucine 

Tyrosine 

Phenylalanine 

Tryptophan 

Cysteine 



Residues 
(Native) 

26 

10 

12 
. 31 
9 
20 
36 
17 
19 
19 
23 

7 
20 
23 
12 
15 

7 

3 



Residues 
j Recombinant ) 

27 
10 
13 
30 

6 
19 
37 
18 
17 
19 
23 

9 
21 
22 
13 
15 
6 
3 



^Aspartate + Asparagine " : 

**Gltamate + Glutamine 

25 «? fc i? e lu< ? iferase composition data taken from Matthews 
25 et al,, Biochemistry (1977) 16: 85-91. WS 
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Further evidence that the cDNA does code for 
luciferase can be seen by comparing the V-8 protease 
peptide sequences with the translated cDNA sequence 
(Figure 3). All v-8 peptides were located on the 
carboxyl-terminal half of the translated coding region 
beginning at residue 161; several overlapped with one 
another. Except at one residue, 219, where the cDNA 
sequence predicts a tryptophan but peptide 6 sequence 
indicates a leucine at the same position, all peptides 
matched exactly to regions of the translated 
sequence. Bunching of the -peptides at one end of the ' 
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protein sequence may be due to the arnino-terminal half 
of the native protein being folded in such a way as to 
be inaccessible to V-8 protease. 

5 Expression of Recom binant Luciferase in E. coli 

The original *RLuc 6 lysogen showed low levels 
of luciferase activity as determined by light 
emission. IPTG induction of xRLuc-6 lysogens led to an 
approximate 50% decrease in activity. This result was 
10 later explained when DNA sequence data revealed that 
the 3' end of the cDNA was adjacent to the lac 2 
sequence in xgtll. Therefore, under conditions of IPTG 
induction, transcription was being forced in the wrong 
direction with respect to the luciferase cDNA 
15 orientation. Presumably, the non-induced luciferase 
expression in this construct was due to promoter 
activity from the left end of xgtli at a site which we " 
have not determined. 

The construct pTZRLuc-1 was made to simplify 
20 the isolation of DNA fragments for use as probes in 
Southern and Northen analysis (Figure 6). E. CQ n 
cells harboring this plasmid are referred to as 
PTZRLuc-1 cells. Similar to xgtll, the pTZ series 
"phagemids" contain a polylinker. site adjacent to the 
25 lac Z' gene. Expressed genes in this vector could 

potentially be expressed containing the first 10 to 15 
amino acids of B-galactosidase fused to the cDNA 
translation product. Analysis of pTZRLuc-1 cell 
supernatants for light emission showed that, relative 
to XRLuc-6, high levels of luciferase activity were 
present. Furthermore, induction of pTZRLuc-1 cells 
with 0.5 inM IPTG led to an increase in luciferase 
activity of - 5-8 fold in crude extracts. 

The bioluminescence emission spectrum from 
these crude supernatants was identical to the 
previously published bioluminescence emission spectrum 
for native Renilla luciferase. /The wavelength 
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distribution of light emission is essentially identical 
to that reported earlier. The spectrum had an emission 
maximum (xmax) at 480 nm with a slight shoulder at 400 
nm, which presumably corresponded to the luciferase- 
oxyluciferin complex neutral species excited state. 

The pTZRLuc-1 crude supernatants were further 
characterized by SDS-PAGE. The Cooma SS i e - sta i n ed- gel 
contained numerous bands, one of which ran in the 
vicinity of native luciferase. To confirm that this 
band was recombinant luciferse. Western analysis was 
performed using rabbit polyclonal antibodies raised 
against native Renilla luciferase.. The developed 
Western showed one band that migrated at the same 
position as native luciferase. No other products 
indicative of 8-galactosidase-lucif erase fusion 
polypeptide were apparent, suggesting that either any 
putative fusion protein is in too low a concentration 
to be detected .or, more likely, that no fusion protein 
is made. Though it has not been confirmed by DNA 
sequence analysis, any pTZRLuc-1 translation products 
initiating at the B-galactosidase ATG start codon 
within the. first three codons immediately adjacent to 
the first cDNA start codon may explain why we see IPTG 
induction of luciferase activity without production of 
25 a fusion product. 

IPTG induction of recombinant luciferase 
indicates that its transcription is directed by the lac 
2 promoter, since the only candidate ribosome binding 
site (RBS) is probably positioned too far (18 nucleo- 
tides) from the luciferase ATG to be functional, we 
suspect that a B-galactosidase peptide is being 
translated to the stop codon immediately adjacent to 
the luciferase ATG. The translation of a 8- 
galactosidase peptide may facilitate ribosome 
reinitiation at the luciferase ATG codon (Figure 7). 
This event could occur if the dinucleotide AG was 
acting as a RBS for the luciferase cDNA. in this way 
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an IPTG inducible, non-fusion luciferase polypeptide 
could be synthesized. Given the success of recombinant 
luciferase expression using the pTZ18 vector, which was 
designed as a multi-purpose in vitro transcription 
5 vector rather than an expression vector, it is obvious 
that other clones can be developed which express 
luciferase at levels greater than those which we " 
currently obtain. 

10 Purifica tion of Recombinant Renilla Luciferase 
Using the specific activity for native 
luciferase, we made calculations for the amount of 
luciferase present in IPTG induced, v pTZRLuc-1 crude 
supernatants and determined that the amount of 
recombinant luciferase being produced was sufficient to 
attempt initial purification on a small scale. 

In IPTG induced pTZRLuc-1 cells, recombinant 
luciferase represents approximately 12-14% of the total 
protein in the clarified crude supernatant.' Although 
significant losses of recombinant luciferase were 
suffered in this initial purification, the amount of 
starting material and time involved made the loss seem 
insignificant when compared to the purification of 
native luciferase. The purification scheme for the 
25 recombinant Renilla luciferase is shown in Figure 8; 
the purification is summarized in Table 3. SDS-PAGE 
analysis of the purification steps shows increasing 
amounts of recombinant luciferase with respect to 
contaminating protein. The Benzoic Acid-Sepharose 
luciferase is approximately 99% pure as evidenced by a 
single band of M r equal to 34 Kd. Very slight 
contamination, was noticible on the Coomassie stained 
gel if more than 20 yg of protein were loaded. 

35 
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important feature for -the utility of recombinant 
lucif erase in diagnostic applications, many of which 
require incubation at physiological temperatures. 

Purification of recombinant Renilla luciferase 
has allowed us to determine its amino-terminal 
sequence. The amino acid sequence of the first 18 
residues was determined by Edman degradation. The 
amino acid peak heights of the sequence data indicated 
that far less protein was actually being sequenced than 
was initially protein synthesis, it may be that a large 
percentage of recombinant luciferase is N-formylated at 
the initiating methionine and thus blocked to the Edman 
reaction. In spite of this apparent amino-terminal 
block, enough unblocked species were available that we 
15 were able to obtain sequence for the first 18 residues 
of recombinant luciferase. The amino acid sequence is 
identical to the. translated cDNA sequence from residues 
2 through 18 (Figure 7). On cycle 1 of the amino acid 
sequencing run, we were not able to confirm the 
20 presence of a methionine at the first residue as 

predicted by the cDNA sequence. However, the fact that 
the two sets of amino acid data are identical from 
amino acid residue 2 (Threonine) to residue 18 
(Proline) strongly supports our assertion that the 
25 first predicted methionine in our sequence is acting as 
the initiation codon in the pTZRLuc-1 construct. 

All publications and patent applications 
mentioned in this specification are herein incorporated 
by reference to the same extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

The invention now being fully described, it 
will be apparent to one of ordinary skill in the art 
35 that many changes and modifications can be made thereto 
without departing from the spirit or scope of the 
appended claims. 
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reaction, in spite of this apparent amino-terminal 
block, enough unblocked species were available that we 
were able to obtain sequence for the first 18 residues 
of recombinant luciferase. The amino acid sequence is 
identical to the. translated cDNA sequence from residues 
2 through 18 (Figure 7). On cycle 1 of the amino acid 
sequencing run, we were not able to confirm the 
presence of a methionine at the first residue as 
predicted by the cDNA sequence. However, the fact that . 
the two sets of amino acid data are identical from 
amino acid residue 2 (Threonine) to residue 18 
(Proline) strongly supports our assertion that the 
first predicted methionine in our sequence is acting as 
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WHAT IS CLAIMED IS : 

1. An isolated DNA or RNA molecule, which 
comprises a nucleotide sequence coding for Renilla 
lucif erase . 

2. The molecule of Claim 1, wherein said molecule 
comprises the luciferase coding sequence: 

AGCTTAAAGATGACTTCGAAAGTTTATGATCCAGAAGAAAGGAAACGGAT 

GATAACTGGTCCGCAGTGGTGGGCCAGATGTAAACAAATGAATGTTCTTG 

ATTCATTTATTAATTATTATGATTCAGAAAAACATGCAGAAAATGCTGTT 

ATTTTTT TACATGGTAACGCGGCCTCTTCTTATTTATGGCGACATGTTGT 

GCCACATATTGAGCCAGTAGCGCGGTGTATTATACCAGATCTTATTGGTA 

TGGGCAAATCAGGCAAATCTGGTAATGGTTCTTATAGGTTACTTGATCAT 

TACAAATATCTTACTGCATGGTTTGAACTTCTTAATTTACCAAAGAAGAT 

CATTTTTGTCGGCCATGATTGGGGTGCTTGTTTGGCATTTCATTATAGCT 

ATGAGCATCAAGATAAGATCAAAGCAATAGTTCACGCTGAAAGTGTAGTA 

GATGTGATTGAATCATGGGATGAATGGCCTGATATTGAAGAAGATATTGC 

GTTGATCAAATCTGAAGAAGGAGAAAAAATGGTTTTGGAGAATAACTTCT 

TCG TG G AAAC CATGTTG CC ATC AAAAATCATG AG AAAG TT AG AACC AG AA 

GAATTTGCAGCATATCTTGAACCATTCAAAGAGAAAGGTGAAGTTCGTCG 

TCCAACATTATCATGGCCTCGTGAAATCCCGTTAGTAAAAGGTGGTAAAC 

CTGACGTTGTACAAATTGTTAGGAATTATAATGCTTATCTACGTGCAAGT 

GATGATTTACCAAAAATGTTTATTGAATCGGATCCAGGATTCTTTTCCAA 

TGCTATTGTTGAAGGCGCCAAGAAGTTTCCTAATACTGAATTTGTCAAAG 

T AAAAG GTCTT C ATTTTTCGC AAG AAG ATGC AC CTG ATG AAATGGG AAAA 

TATATCAAATCGTTCGTTGAGCGAGTTCTCAAAAATGAACAA 

or an equivalent DNA or RNA sequence. 

3. The molecule of Claim 2, wherein said .molecule 
is DNA. 

4. The molecule of Claim 3, wherein said molecule 
contains said luciferase sequence. 
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5. The molecule of Claim 2, wherein said molecule 
is RNA and contains a sequence equivalent to said 
lucif erase sequence. 

6. The molecule of Claim 1, wherein said sequence 
is preceded by a functional promoter sequence 5' to 
said sequence. 

7. The molecule of Claim 6, wherein at least one 
copy of said sequence is present in a recombinant DNA 
or RNA vector. 



8. A genetically engineered microorganism, 
15 wherein said microorganism comprises the vector of 
Claim 7. 



9. The microorganism of Claim 8, wherein said 
microorganism is an E. coli strain. 
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10. An isolated oligonucleotide, comprising at 
least 10 consecutive nucleotides selected from nucle- 
otide sequence: 

AGCTTAAAGATGACTTCGAAAGTTTATGATCCAGAACAAAGGAAACGGAT 
GATAACTGGTCCGCAGTGGTGGGCCAGATGTAAACAAATGAATGTTCTTG 
ATTCATTTATTAATTATTATG ATTCAGAAAAAC ATG CAGAAAATG CTG TT 
GCCACATATTGAGCCAGTAGCGCGGTGTATTATACCAGATCTTATTGGTA 
TGGGCAAAT C AG G CAAATCTG G TAATGGTTCTTATAGG TT ACTTG ATC AT 
TACAAATATCTTACTGCATGGTTTGAACTTCTTAATTTACCAAAGAAGAT 
CATTTTTGTCGGCCATGATTGGGGTGCTTGTTTGGCATTTCATTATAGCT 
ATG AG CAT CAAGATAAG ATCAAAG CAATAG TT CACGCTG AAAG TGT AGTA 

GATGTGATTGAATCATGGGATGAATGGCCTGATATTGAAGAAGATATTGC 
GTTGATCAAATCTGAAGAAGGAGAAAAAATGGTTTTGGAGAATAACTTCT 
TCGTGGAAACCATGTTGCCATCAAAAATCATGAGAAAGTTAGAACCAGAA 
GAATTTGCAGCATATCTTGAACCATTCAAAGAGAAAGGTGAAGTTCGTCG 
TCCAACATTATCATGGCCTCGTGAAATCCCGTTAGTAAAAGGTGGTAAAC 



3CID: <WO 82i5673A1> 



WO 92/15673 



PCI7US91/01614 



35 

CTGACGTTGTACAAATTGTTAGGAATTATAATGCTTATCTACGTGCAAGT 
GATGATTTACCAAAAATGTTTATTGAATCGGATCCAGGATTCTTTTCCAA 
TGCTATTGTTGAAGGCGCCAAGAAGTTTCCTAATACTGAATTTGTCAAAG 
TAAAAGGTCTTCATTTTTCGCAAGAAGATGCACCTGATGAAATGGGAAAA 
5 TATATCAAATCGTTCGTTGAGCGAGTTCTCAAAAATGAACAATAATTACT 
TTGGTTTTTTATTTACATTTTTCCCGGGTTTAATAATATAAATGTCATTT 
TCAACAATTTTATTTTAACTGAATATTTCACAGGGAACATTCATATATGT 
TGATTAATTTAGCTCGAACTTTACTCTGTCATATCATTTTGGAATATTAC 
CT CTTT C AATG AAAC TTT AT AAA C AG TGG TT C AATTAATTAATATAT ATT 
1 0 ATAATTACATTTGTTATGTAATAAACTCGGTTTTATTATAA , 

complementary DNA sequences, and equivalent or 
complementary RNA sequences. 

15 11 • The oligonucleotide of Claim 10, wherein said 

oligonucleotide is labeled with a detectable tag. 

12. The oligonucleotide of Claim 10, wherein said 
oligonucleotide comprises at least 14 consecutive 

20 nucleotides. 

13. A method of identifying genetic material 
encoding a coelenterate luciferase gene, comprising; 

isolating genetic material from a coelenterate 
25 organism to form a sample of genetic material, . 

contacting said sample with an oligonucleotide 
of Claim 10 under hybridizing conditions, and 

detecting formation of a duplex comprising 
said oligonucleotide and DNA or RNA present in said 
30 sample. 

14. The method of Claim 13, where said method 
comprises a polymerase chain reaction. 

35 15. A genetically engineered peptide encoded by a 

nucleotide sequence of Claim 1. 
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16. The peptide of Claim 15, wherein said peptide 
is non-glycosylated. 

17. A peptide comprising the amino acid sequence cf 
Figure 3 or a fragment of said sequence comprising at Last 
5 consecutive amino acid residues in which the fragment is 
immunologically reactive with an antibody that specifically 
binds Renilla lucif erase, wherein said peptide is free from 
other Renilla peptides. 
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FIG. 2 
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1) 6LU-6LV-HLH LY5-LYS-PHE-PR0-fl$N-THR- GLU 



2) GLU-RRG-URL-LEU-LYS-RSN-GLU 

3) GLU-GLY-GLU-LYS-MET-URL-LElf-GLlI 

4) GLU-ASP-ILE-RLH-LEU-ILE-LYS-SER-GLU 

5) GLU-5ER-flSP-PRO-GLY-PHE-PHE-$ER-fl$NHLE-URL-GLU 

6) GLU-LY$-6LY-GLU-URL-flRG-RRG-PR0-THR-LEU-$ER-LEU-PR0-flR6- 
GLU-ILE-PRO-LEU-URL-LYS-GLY 



7) 



GLU-flSN-RSN-PHE-PHE-URL- GLU 



GLU-flSN-fl$N-PHE-PHE-UflL 



Lucif erase Probe #1: Gflfl-flflT- R RT-TTT-TTT- GT 
(32 degeneracies) 6 C C C C 



LYS-LYS-PHE- PRO-RSN-THR 



Luciferase Probe #2: Rflfl-flflH-TTT-CCT-flflT-RC 
(64 degeneracies) 6 G C C C 

A 
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FIG. 5 



O 3 
ULI< 



5 f 



sz 

JL 



fcr to 

IT 



in 



3' 



100 bp 



5'i 



200 bp 



I 

400 bp 



600 bp 



SOObp 



"1 

1000 bp 



-) 3 ' 

1200 bp 



FIG. 6 




OCJD: <WO 0215673A1> 



SUBSTITUTE SHEET 



WO 92/15673 



PCT/US91/01614 



7/8 



pTZRLuc-1 



Direction of translation ^ 

lac r RBS lac T start codon 

i \ 

6T6T66AATTGTGA6CGGATAACAATTTCACACAfifiAAACA6CTAI£ACCATG 
putative RBS ludlferase cDNA "start" codon 

} \ 

ATTAC6AATTTAATAC6ACTCACTATA666AATTCA£CTTAAAGAiaACTTC6 

t 

lac T in frame stop codon 

AAA6TTTATGATCCA6AA 

RBS - ribosome binding site 

FIG. 7 
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